Efficient language model development for spoken dialogue recognition and its evaluation on operator's speech at call centers
نویسندگان
چکیده
While a language model for recognition of spoken dialogue is ideally built from a very large, specific-task-oriented corpus, a great amount of time and effort is required to develop such a corpus, and this involves both the audio recording and written transcription of large amounts of speech data. Training data for a language model should match the target task in both topic and style. What is needed, then, is a method to utilize previously existing spoken dialogue corpora that are not necessarily related to the specific target-task. Such corpora would be combined with documents related to the topic of the target-task to develop a language model for the target spoken-dialogue. In this paper, we propose a method for combining previously existing corpora with key phrases (i.e. phrases that contain keywords) extracted from task related documents. Even though the added data is from documents related to the target dialogue, since it consists of key phrases, stylistic differences (between document data and the actual dialogue to which the model will be applied) are not a problem. We have produced a model using this method and have evaluated it in use on actual spoken dialogue collected at call centers. Experimental results show that a relative 13% reduction in word error rate could be achieved with the addition of key phrases. This performance is nearly as good as that which would be achieved on the basis of a large, expensive transcript-corpus, and the cost of producing the key phrase data is essentially negligible. Such cost reduction achieved by our method will enable speech recognition applications to be more widely used.
منابع مشابه
Data-Driven Language Understanding for Spoken Language Dialogue∗
We present a natural-language customer service application for a telephone banking call center, developed as part of the AMITIES dialogue project (Automated Multilingual Interaction with Information and Services). Our dialogue system, based on empirical data gathered from real call-center conversations, features data-driven techniques that allow for spoken language understanding despite speech ...
متن کاملTowards best practice in the development and evaluation of speech recognition components of a spoken language dialog system
Spoken Language Dialog Systems (SLDSs) aim to use natural spoken input for performing an information processing task such as call routing or train ticket reservation (Lamel et al., 1995). The main functionality of an SLDS are speech recognition, natural language understanding, dialog management, response generation and the speech synthesis. This article summarizes key aspects of the current pra...
متن کاملThe AMITIÉS System: Data-Driven Techniques for Automated Dialogue1
We present a natural-language customer service application for a telephone banking call center, developed as part of the AMITIÉS dialogue project (Automated Multilingual Interaction with Information and Services). Our dialogue system, based on empirical data gathered from real call-center conversations, features data-driven techniques that allow for spoken language understanding despite speech ...
متن کاملAutomatic Spoken Language Identification Utilizing Acoustic and Phonetic Speech Information
Automatic spoken Language Identification (LID) is the process of identifying the language spoken within an utterance. The challenge that this task presents is that no prior information is available indicating the content of the utterance or the identity of the speaker. The trend of globalization and the pervasive popularity of the Internet will amplify the need for the capabilities spoken langu...
متن کاملAn understanding strategy based on plausibility score in recognition history using CSR confidence measure
Although car-navigation systems attract attention as one of spoken dialogue interfaces, recognition errors due to the influence of natural speech and surrounding noise may prevent a smooth dialogue and disappoint the user. Thus, this research aims at the construction of a dialogue system which can achieve a smooth dialogue and a high degree of user satisfaction. Our system performs language und...
متن کامل